Network Performance in High Performance Linux Clusters

نویسندگان

  • Ben Huang
  • Michael Anthony Bauer
  • Michael Katchabaw
چکیده

Linux-based clusters have become more prevalent as a foundation for High Performance Computing (HPC) systems. With a better understanding of network performance in these environments, we can optimize configurations and develop better management and administration policies to improve operations. To assist in this process, we developed a network measurement tool to measure UDP, TCP and MPI communications over high performance networks, such as Gigabit Ethernet and Myrinet. In this paper, we report on the use of this tool to evaluate the network performance of three high performance interconnects in HPC clusters: Gigabit Ethernet, Myrinet, and Quadrics’ QsNet and discuss the implications of those results for configurations in HPC Linux clusters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Network Performance in Distributed HPC Clusters

Linux-based clusters have become prevalent as a foundation for High Performance Computing (HPC) systems. As these clusters become more affordable and available, and with the emergence of high speed networks, it is becoming more feasible to create HPC grids consisting of multiple clusters. One of the attractions of such grids is the potential to scale applications across the various clusters. Th...

متن کامل

Parallel computing using MPI and OpenMP on self-configured platform, UMZHPC.

Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...

متن کامل

Performance Considerations for Network Switch Fabrics on Linux Clusters

One of the most significant components in a cluster is the interconnection network between computational nodes. A majority of today’s clusters use either switched Fast Ethernet, Gigabit Ethernet, or a specialized switch fabric to connect nodes. However, the use of these specialized switch fabrics may not necessarily benefit the users, and in some cases they perform only slightly better than com...

متن کامل

PVFS: A Parallel File System for Linux Clusters

As Linux clusters have matured as platforms for lowcost, high-performance parallel computing, software packages to provide many key services have emerged, especially in areas such as message passing and networking. One area devoid of support, however, has been parallel file systems, which are critical for highperformance I/O on such clusters. We have developed a parallel file system for Linux c...

متن کامل

Enhancing TCP Performance for Dedicated Clusters and Grids

TCP congestion control methods seriously and unnecessarily harm performance of network transmissions when used in dedicated clusters and grids. We present a simple method in which congestion control can be disabled under appropriate circumstances while still addressing fairness issues and avoiding congestion collapse. We discuss a Linux-based implementation of this “Rude TCP”1 and demonstrate t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005